Search CORE

26 research outputs found

Marathon: An open source software library for the analysis of Markov-Chain Monte Carlo algorithms

Author: Berger Annabell
Rechner Steffen
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 29/01/2016
Field of study

In this paper, we consider the Markov-Chain Monte Carlo (MCMC) approach for random sampling of combinatorial objects. The running time of such an algorithm depends on the total mixing time of the underlying Markov chain and is unknown in general. For some Markov chains, upper bounds on this total mixing time exist but are too large to be applicable in practice. We try to answer the question, whether the total mixing time is close to its upper bounds, or if there is a significant gap between them. In doing so, we present the software library marathon which is designed to support the analysis of MCMC based sampling algorithms. The main application of this library is to compute properties of so-called state graphs which represent the structure of Markov chains. We use marathon to investigate the quality of several bounding methods on four well-known Markov chains for sampling perfect matchings and bipartite graph realizations. In a set of experiments, we compute the total mixing time and several of its bounds for a large number of input instances. We find that the upper bound gained by the famous canonical path method is several magnitudes larger than the total mixing time and deteriorates with growing input size. In contrast, the spectral bound is found to be a precise approximation of the total mixing time

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Directory of Open Access Journals

PubMed Central

FigShare

Gerbil: A Fast and Memory-Efficient $k$ -mer Counter with GPU-Support

Author: Erbert Marius
Müller-Hannemann Matthias
Rechner Steffen
Publication venue
Publication date: 22/07/2016
Field of study

A basic task in bioinformatics is the counting of

k

-mers in genome strings. The

k

-mer counting problem is to build a histogram of all substrings of length

k

in a given genome sequence. We present the open source

k

-mer counting software Gerbil that has been designed for the efficient counting of

k

-mers for

k\geq32

. Given the technology trend towards long reads of next-generation sequencers, support for large

k

becomes increasingly important. While existing

k

-mer counting tools suffer from excessive memory resource consumption or degrading performance for large

k

, Gerbil is able to efficiently support large

k

without much loss of performance. Our software implements a two-disk approach. In the first step, DNA reads are loaded from disk and distributed to temporary files that are stored at a working disk. In a second step, the temporary files are read again, split into

k

-mers and counted via a hash table approach. In addition, Gerbil can optionally use GPUs to accelerate the counting step. For large

k

, we outperform state-of-the-art open source

k

-mer counting tools for large genome data sets.Comment: A short version of this paper will appear in the proceedings of WABI 201

arXiv.org e-Print Archive

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Timing of Train Disposition: Towards Early Passenger Rerouting in Case of Delays

Author: Blendinger Christoph
Lemnian Martin
Rechner Steffen
Publication venue: OASIcs - OpenAccess Series in Informatics. 14th Workshop on Algorithmic Approaches for Transportation Modelling, Optimization, and Systems
Publication date: 01/01/2014
Field of study

Passenger-friendly train disposition is a challenging, highly complex online optimization problem with uncertain and incomplete information about future delays. In this paper we focus on the timing within the disposition process. We introduce three different classification schemes to predict as early as possible the status of a transfer: whether it will almost surely break, is so critically delayed that it requires manual disposition, or can be regarded as only slightly uncertain or as being safe. The three approaches use lower bounds on travel times, historical distributions of delay data, and fuzzy logic, respectively. In experiments with real delay data we achieve an excellent classification rate. Furthermore, using realistic passenger flows we observe that there is a significant potential to reduce the passenger delay if an early rerouting strategy is applied

Dagstuhl Research Online Publication Server

Increased bioavailability of phenolic acids and enhanced vascular function following intake of feruloyl esterase-processed high fibre bread: a randomized, controlled, single blind, crossover human intervention trial

Author: Adom
Alice L. Turner
Alison Lovegrove
Andreasen
Andreasen
Anson
Aune
Bach Knudsen
Belobrajdic
Borneo
Buchanan
Buscemi
Caccetta
Choudhury
Crozier
de Ferrars
Faulds
Ferrell
Heiss
Jeremy P.E. Spencer
Jonnalagadda
Katz
Kern
Khan
Kroon
Li
Louise V. Michaelson
Mateo Anson
Mellen
Millasseau
Morris
Norskov
Peter R. Shewry
Price
Qi
Qing Ye
Ramsay
Reboredo-Rodríguez
Rechner
Rodriguez-Mateos
Scientific Advisory Committee on Nutrition
Shrime
Stalmach
Steffen
Steffen
Stolk
Stone
Suzuki
Tresserra-Rimbau
van Bussel
Vauzour
Vauzour
Vitaglione
Vollmer
Wang
Wei
Widmer
Williamson
Wykretowicz
Yin
Publication venue: 'Elsevier BV'
Publication date: 01/01/2020
Field of study

Background & aims Clinical trial data have indicated an association between wholegrain consumption and a reduction in surrogate markers of cardiovascular disease. Phenolics present in wholegrain bound to arabinoxylan fibre may contribute these effects, particularly when released enzymatically from the fiber prior to ingestion. The aim of the present study was therefore to determine whether the intake of high fibre bread containing higher free ferulic acid (FA) levels (enzymatically released during processing) enhances human endothelium-dependent vascular function. Methods A randomized, single masked, controlled, crossover, human intervention study was conducted on 19 healthy men. Individuals consumed either a high fibre flatbread with enzymatically released free FA (14.22 mg), an equivalent standard high fibre bread (2.34 mg), or a white bread control (0.48 mg) and markers of vascular function and plasma phenolic acid concentrations were measured at baseline, 2, 5 and 7 h post consumption. Results Significantly increased brachial arterial dilation was observed following consumption of the high free FA (‘enzyme-treated’) high fibre bread verses both a white bread (2 h: p 0.05). Conclusion Dietary intake of bread, processed enzymatically to release FA from arabinoxylan fiber during production increases the bioavailability of FA, and induces acute endothelium-dependent vasodilation. Clinical trial registry: No NCT03946293. Website www.clinicaltrials.gov

Central Archive at the University of Reading

Crossref

Rothamsted Repository

The quality of the upper bounds for rapidly mixing instances.

Author: Annabell Berger (2174906)
Steffen Rechner (2174908)
Publication venue
Publication date
Field of study

The results of <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0147935#pone.0147935.g003" target="_blank">Fig 3</a> filtered to highlight instances with known polynomial mixing time. Instances with no known polynomial bound are coloured gray.</p

FigShare

Influence of the average vertex degree.

Author: Annabell Berger (2174906)
Steffen Rechner (2174908)
Publication venue
Publication date
Field of study

Connection between average vertex degree of a state graph and its total mixing time, respectively canonical path bound.</p

FigShare

Single and double precision performance of the total mixing time computation.

Author: Annabell Berger (2174906)
Steffen Rechner (2174908)
Publication venue
Publication date
Field of study

The charts show the running time for the computation of the total mixing time on the example of five state graphs of size 8012 to 20358. Due to the relatively small amount of GPU memory on our test system, only the first four (respectively two) state graphs could be processed by the GPU implementation in single precision mode (respectively double precision mode). The running times were measured on an Ubuntu 14.04 system with a Intel Xeon E3-1231, NVIDIA GeForce GTX 970 (4 GB GPU memory) and 16 GB of main memory, using gcc in version 4.8.4 and CUDA in version 7.0.</p

FigShare

Relationship between the lower spectral bound and the total mixing time.

Author: Annabell Berger (2174906)
Steffen Rechner (2174908)
Publication venue
Publication date
Field of study

The total mixing time is shown in connection to a corresponding lower spectral bound for sequence pairs of the form (n − 1, n − 2, 2, 1), (2, 2, …, 2). We use the displayed formulas to predict missing values for total mixing time.</p

FigShare